Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 4335 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 575.9 KiB |
| Average record size in memory | 136.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 6 |
Reproduction
| Analysis started | 2020-05-21 15:05:27.635943 |
|---|---|
| Analysis finished | 2020-05-21 15:05:43.651956 |
| Duration | 16.02 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
name has a high cardinality: 4166 distinct values | High cardinality |
host_name has a high cardinality: 909 distinct values | High cardinality |
last_review has a high cardinality: 763 distinct values | High cardinality |
id is highly correlated with df_index | High correlation |
df_index is highly correlated with id | High correlation |
neighbourhood is highly correlated with neighbourhood_group | High correlation |
neighbourhood_group is highly correlated with neighbourhood | High correlation |
name is uniformly distributed | Uniform |
df_index has unique values | Unique |
id has unique values | Unique |
| Distinct count | 4335 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3605.105420991926 |
|---|---|
| Minimum | 0 |
| Maximum | 7767 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 293.7 |
| Q1 | 1822.5 |
| median | 3580 |
| Q3 | 5349.5 |
| 95-th percentile | 7139.3 |
| Maximum | 7767 |
| Range | 7767 |
| Interquartile range (IQR) | 3527 |
Descriptive statistics
| Standard deviation | 2144.353152 |
|---|---|
| Coefficient of variation (CV) | 0.59481011 |
| Kurtosis | -1.104769175 |
| Mean | 3605.105421 |
| Median Absolute Deviation (MAD) | 1763 |
| Skewness | 0.07922266594 |
| Sum | 15628132 |
| Variance | 4598250.441 |
| Value | Count | Frequency (%) | |
| 4094 | 1 | < 0.1% | |
| 6710 | 1 | < 0.1% | |
| 2644 | 1 | < 0.1% | |
| 4691 | 1 | < 0.1% | |
| 6734 | 1 | < 0.1% | |
| 4679 | 1 | < 0.1% | |
| 6726 | 1 | < 0.1% | |
| 581 | 1 | < 0.1% | |
| 2628 | 1 | < 0.1% | |
| 4675 | 1 | < 0.1% | |
| Other values (4325) | 4325 | 99.8% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7767 | 1 | < 0.1% | |
| 7766 | 1 | < 0.1% | |
| 7752 | 1 | < 0.1% | |
| 7728 | 1 | < 0.1% | |
| 7715 | 1 | < 0.1% |
| Distinct count | 4335 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21990286.141868513 |
|---|---|
| Minimum | 49091 |
| Maximum | 37852422 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 49091 |
|---|---|
| 5-th percentile | 4449307.3 |
| Q1 | 14838743 |
| median | 23065902 |
| Q3 | 30271830.5 |
| 95-th percentile | 35745370.3 |
| Maximum | 37852422 |
| Range | 37803331 |
| Interquartile range (IQR) | 15433087.5 |
Descriptive statistics
| Standard deviation | 9902633.473 |
|---|---|
| Coefficient of variation (CV) | 0.4503185365 |
| Kurtosis | -0.9288682316 |
| Mean | 21990286.14 |
| Median Absolute Deviation (MAD) | 7578008 |
| Skewness | -0.3766415221 |
| Sum | 9.532789042e+10 |
| Variance | 9.806214969e+13 |
| Value | Count | Frequency (%) | |
| 20865016 | 1 | < 0.1% | |
| 33813389 | 1 | < 0.1% | |
| 6628037 | 1 | < 0.1% | |
| 11465411 | 1 | < 0.1% | |
| 15870203 | 1 | < 0.1% | |
| 20040797 | 1 | < 0.1% | |
| 3863231 | 1 | < 0.1% | |
| 33919675 | 1 | < 0.1% | |
| 27794106 | 1 | < 0.1% | |
| 36379318 | 1 | < 0.1% | |
| Other values (4325) | 4325 | 99.8% |
| Value | Count | Frequency (%) | |
| 49091 | 1 | < 0.1% | |
| 50646 | 1 | < 0.1% | |
| 56334 | 1 | < 0.1% | |
| 71609 | 1 | < 0.1% | |
| 71896 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 37852422 | 1 | < 0.1% | |
| 37841266 | 1 | < 0.1% | |
| 37798739 | 1 | < 0.1% | |
| 37690516 | 1 | < 0.1% | |
| 37621650 | 1 | < 0.1% |
| Distinct count | 4166 |
|---|---|
| Unique (%) | 96.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| Luxury hostel with in-cabin locker - Single mixed | 9 |
|---|---|
| Studio Apartment - Oakwood Premier | 9 |
| Superhost 1BR APT in the heart of Tg Pagar | 8 |
| Single Capsule For 1 (Free Breakfast) | 7 |
| Spacious bedroom near city centre with free Wi-Fi | 6 |
| Other values (4161) |
| Value | Count | Frequency (%) | |
| Luxury hostel with in-cabin locker - Single mixed | 9 | 0.2% | |
| Studio Apartment - Oakwood Premier | 9 | 0.2% | |
| Superhost 1BR APT in the heart of Tg Pagar | 8 | 0.2% | |
| Single Capsule For 1 (Free Breakfast) | 7 | 0.2% | |
| Spacious bedroom near city centre with free Wi-Fi | 6 | 0.1% | |
| PROMO!! 1B Deluxe apartment perfect for Staycation | 5 | 0.1% | |
| One Bedroom Deluxe Apartment - Oakwood Premier | 4 | 0.1% | |
| NEW En-suite Bedroom /WIFI near Orchard/Somerset | 4 | 0.1% | |
| Leisure 1BR APT located 5 mins from Tg Pagar MRT | 4 | 0.1% | |
| Pleasant & Comfy Studio APT near Potong Pasir MRT | 4 | 0.1% | |
| Other values (4156) | 4275 | 98.6% |
Length
| Max length | 85 |
|---|---|
| Median length | 40 |
| Mean length | 38.75847751 |
| Min length | 1 |
host_id
Real number (ℝ≥0)
| Distinct count | 1232 |
|---|---|
| Unique (%) | 28.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82279199.43183391 |
|---|---|
| Minimum | 23666 |
| Maximum | 282332472 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 23666 |
|---|---|
| 5-th percentile | 2413412 |
| Q1 | 17526618 |
| median | 47611818 |
| Q3 | 135756776 |
| 95-th percentile | 238891646 |
| Maximum | 282332472 |
| Range | 282308806 |
| Interquartile range (IQR) | 118230158 |
Descriptive statistics
| Standard deviation | 79283170.04 |
|---|---|
| Coefficient of variation (CV) | 0.9635870377 |
| Kurtosis | -0.5047551364 |
| Mean | 82279199.43 |
| Median Absolute Deviation (MAD) | 39119811 |
| Skewness | 0.8949660508 |
| Sum | 3.566803295e+11 |
| Variance | 6.285821052e+15 |
| Value | Count | Frequency (%) | |
| 8492007 | 133 | 3.1% | |
| 29420853 | 117 | 2.7% | |
| 66406177 | 106 | 2.4% | |
| 2413412 | 81 | 1.9% | |
| 23722617 | 78 | 1.8% | |
| 31464513 | 71 | 1.6% | |
| 108773366 | 64 | 1.5% | |
| 14521708 | 63 | 1.5% | |
| 8948251 | 52 | 1.2% | |
| 209913841 | 49 | 1.1% | |
| Other values (1222) | 3521 | 81.2% |
| Value | Count | Frequency (%) | |
| 23666 | 1 | < 0.1% | |
| 59498 | 3 | 0.1% | |
| 165209 | 2 | < 0.1% | |
| 184596 | 1 | < 0.1% | |
| 227796 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 282332472 | 1 | < 0.1% | |
| 282327651 | 1 | < 0.1% | |
| 282120323 | 1 | < 0.1% | |
| 282012486 | 1 | < 0.1% | |
| 281898594 | 1 | < 0.1% |
| Distinct count | 909 |
|---|---|
| Unique (%) | 21.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| Alvin | 170 |
|---|---|
| Aaron | 117 |
| Jay | 115 |
| Alex | 89 |
| Kaurus | 81 |
| Other values (904) |
| Value | Count | Frequency (%) | |
| Alvin | 170 | 3.9% | |
| Aaron | 117 | 2.7% | |
| Jay | 115 | 2.7% | |
| Alex | 89 | 2.1% | |
| Kaurus | 81 | 1.9% | |
| Darcy | 71 | 1.6% | |
| RedDoorz | 64 | 1.5% | |
| Shirley | 64 | 1.5% | |
| Joey | 60 | 1.4% | |
| Richards | 49 | 1.1% | |
| Other values (899) | 3455 | 79.7% |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 5.910495963 |
| Min length | 1 |
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| Central Region | |
|---|---|
| East Region | 266 |
| West Region | 257 |
| North-East Region | 160 |
| North Region | 89 |
| Value | Count | Frequency (%) | |
| Central Region | 3563 | 82.2% | |
| East Region | 266 | 6.1% | |
| West Region | 257 | 5.9% | |
| North-East Region | 160 | 3.7% | |
| North Region | 89 | 2.1% |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 13.7077278 |
| Min length | 11 |
| Distinct count | 40 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| Kallang | |
|---|---|
| Geylang | |
| Outram | 308 |
| Rochor | 301 |
| Novena | 287 |
| Other values (35) |
| Value | Count | Frequency (%) | |
| Kallang | 613 | 14.1% | |
| Geylang | 580 | 13.4% | |
| Outram | 308 | 7.1% | |
| Rochor | 301 | 6.9% | |
| Novena | 287 | 6.6% | |
| Downtown Core | 235 | 5.4% | |
| River Valley | 231 | 5.3% | |
| Bukit Merah | 216 | 5.0% | |
| Bedok | 206 | 4.8% | |
| Queenstown | 115 | 2.7% | |
| Other values (30) | 1243 | 28.7% |
Length
| Max length | 23 |
|---|---|
| Median length | 7 |
| Mean length | 8.287658593 |
| Min length | 5 |
latitude
Real number (ℝ≥0)
| Distinct count | 3117 |
|---|---|
| Unique (%) | 71.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3124298108419838 |
|---|---|
| Minimum | 1.24526 |
| Maximum | 1.45203 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 1.24526 |
|---|---|
| 5-th percentile | 1.279557 |
| Q1 | 1.295815 |
| median | 1.31038 |
| Q3 | 1.318845 |
| 95-th percentile | 1.370646 |
| Maximum | 1.45203 |
| Range | 0.20677 |
| Interquartile range (IQR) | 0.02303 |
Descriptive statistics
| Standard deviation | 0.02843622 |
|---|---|
| Coefficient of variation (CV) | 0.02166685012 |
| Kurtosis | 5.114935631 |
| Mean | 1.312429811 |
| Median Absolute Deviation (MAD) | 0.01222 |
| Skewness | 1.878096428 |
| Sum | 5689.38323 |
| Variance | 0.0008086186079 |
| Value | Count | Frequency (%) | |
| 1.28376 | 7 | 0.2% | |
| 1.31443 | 6 | 0.1% | |
| 1.31137 | 6 | 0.1% | |
| 1.31314 | 5 | 0.1% | |
| 1.31408 | 5 | 0.1% | |
| 1.29488 | 5 | 0.1% | |
| 1.31372 | 5 | 0.1% | |
| 1.30352 | 5 | 0.1% | |
| 1.3145 | 5 | 0.1% | |
| 1.3143 | 5 | 0.1% | |
| Other values (3107) | 4281 | 98.8% |
| Value | Count | Frequency (%) | |
| 1.24526 | 1 | < 0.1% | |
| 1.24853 | 1 | < 0.1% | |
| 1.26478 | 1 | < 0.1% | |
| 1.26513 | 1 | < 0.1% | |
| 1.26569 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.45203 | 1 | < 0.1% | |
| 1.44968 | 1 | < 0.1% | |
| 1.44928 | 1 | < 0.1% | |
| 1.44861 | 1 | < 0.1% | |
| 1.44843 | 1 | < 0.1% |
longitude
Real number (ℝ≥0)
| Distinct count | 3387 |
|---|---|
| Unique (%) | 78.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.85037353633217 |
|---|---|
| Minimum | 103.66547 |
| Maximum | 103.97341999999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 103.66547 |
|---|---|
| 5-th percentile | 103.764189 |
| Q1 | 103.83847 |
| median | 103.85024 |
| Q3 | 103.875515 |
| 95-th percentile | 103.9118 |
| Maximum | 103.97342 |
| Range | 0.30795 |
| Interquartile range (IQR) | 0.037045 |
Descriptive statistics
| Standard deviation | 0.04205745333 |
|---|---|
| Coefficient of variation (CV) | 0.0004049812427 |
| Kurtosis | 2.499454124 |
| Mean | 103.8503735 |
| Median Absolute Deviation (MAD) | 0.0138 |
| Skewness | -0.8255072814 |
| Sum | 450191.3693 |
| Variance | 0.001768829381 |
| Value | Count | Frequency (%) | |
| 103.88788 | 5 | 0.1% | |
| 103.84536 | 5 | 0.1% | |
| 103.84014 | 5 | 0.1% | |
| 103.83863 | 5 | 0.1% | |
| 103.84794 | 5 | 0.1% | |
| 103.8436 | 5 | 0.1% | |
| 103.85192 | 5 | 0.1% | |
| 103.84691 | 4 | 0.1% | |
| 103.85207 | 4 | 0.1% | |
| 103.84296 | 4 | 0.1% | |
| Other values (3377) | 4288 | 98.9% |
| Value | Count | Frequency (%) | |
| 103.66547 | 1 | < 0.1% | |
| 103.68676 | 1 | < 0.1% | |
| 103.68744 | 1 | < 0.1% | |
| 103.68779 | 1 | < 0.1% | |
| 103.68782 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 103.97342 | 1 | < 0.1% | |
| 103.97292 | 1 | < 0.1% | |
| 103.97105 | 1 | < 0.1% | |
| 103.96815 | 1 | < 0.1% | |
| 103.96739 | 1 | < 0.1% |
room_type
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 258 |
| Value | Count | Frequency (%) | |
| Entire home/apt | 2281 | 52.6% | |
| Private room | 1796 | 41.4% | |
| Shared room | 258 | 6.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.51903114 |
| Min length | 11 |
price
Real number (ℝ≥0)
| Distinct count | 290 |
|---|---|
| Unique (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149.0701268742791 |
|---|---|
| Minimum | 14 |
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 65 |
| median | 119 |
| Q3 | 199 |
| 95-th percentile | 368 |
| Maximum | 999 |
| Range | 985 |
| Interquartile range (IQR) | 134 |
Descriptive statistics
| Standard deviation | 116.0404973 |
|---|---|
| Coefficient of variation (CV) | 0.7784289164 |
| Kurtosis | 6.918869806 |
| Mean | 149.0701269 |
| Median Absolute Deviation (MAD) | 59 |
| Skewness | 2.094410009 |
| Sum | 646219 |
| Variance | 13465.39702 |
| Value | Count | Frequency (%) | |
| 60 | 123 | 2.8% | |
| 50 | 109 | 2.5% | |
| 100 | 98 | 2.3% | |
| 119 | 89 | 2.1% | |
| 200 | 89 | 2.1% | |
| 90 | 86 | 2.0% | |
| 56 | 84 | 1.9% | |
| 150 | 84 | 1.9% | |
| 131 | 82 | 1.9% | |
| 69 | 81 | 1.9% | |
| Other values (280) | 3410 | 78.7% |
| Value | Count | Frequency (%) | |
| 14 | 1 | < 0.1% | |
| 18 | 2 | < 0.1% | |
| 19 | 17 | 0.4% | |
| 21 | 4 | 0.1% | |
| 22 | 17 | 0.4% |
| Value | Count | Frequency (%) | |
| 999 | 2 | < 0.1% | |
| 900 | 1 | < 0.1% | |
| 887 | 1 | < 0.1% | |
| 881 | 1 | < 0.1% | |
| 844 | 1 | < 0.1% |
minimum_nights
Real number (ℝ≥0)
| Distinct count | 49 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.011534025374855 |
|---|---|
| Minimum | 1 |
| Maximum | 700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 90 |
| Maximum | 700 |
| Range | 699 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 32.24885722 |
|---|---|
| Coefficient of variation (CV) | 2.478482334 |
| Kurtosis | 78.35121349 |
| Mean | 13.01153403 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.563165133 |
| Sum | 56405 |
| Variance | 1039.988792 |
| Value | Count | Frequency (%) | |
| 1 | 1164 | 26.9% | |
| 2 | 932 | 21.5% | |
| 3 | 603 | 13.9% | |
| 90 | 239 | 5.5% | |
| 5 | 228 | 5.3% | |
| 7 | 212 | 4.9% | |
| 30 | 175 | 4.0% | |
| 4 | 142 | 3.3% | |
| 6 | 133 | 3.1% | |
| 18 | 108 | 2.5% | |
| Other values (39) | 399 | 9.2% |
| Value | Count | Frequency (%) | |
| 1 | 1164 | 26.9% | |
| 2 | 932 | 21.5% | |
| 3 | 603 | 13.9% | |
| 4 | 142 | 3.3% | |
| 5 | 228 | 5.3% |
| Value | Count | Frequency (%) | |
| 700 | 1 | < 0.1% | |
| 365 | 7 | 0.2% | |
| 356 | 1 | < 0.1% | |
| 190 | 1 | < 0.1% | |
| 183 | 3 | 0.1% |
number_of_reviews
Real number (ℝ≥0)
| Distinct count | 202 |
|---|---|
| Unique (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.128950403690887 |
|---|---|
| Minimum | 1 |
| Maximum | 323 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 24 |
| 95-th percentile | 91.3 |
| Maximum | 323 |
| Range | 322 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 35.6465398 |
|---|---|
| Coefficient of variation (CV) | 1.687094679 |
| Kurtosis | 14.5780103 |
| Mean | 21.1289504 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 3.363946268 |
| Sum | 91594 |
| Variance | 1270.6758 |
| Value | Count | Frequency (%) | |
| 1 | 862 | 19.9% | |
| 2 | 454 | 10.5% | |
| 3 | 289 | 6.7% | |
| 4 | 206 | 4.8% | |
| 5 | 171 | 3.9% | |
| 6 | 159 | 3.7% | |
| 7 | 127 | 2.9% | |
| 8 | 110 | 2.5% | |
| 9 | 102 | 2.4% | |
| 11 | 93 | 2.1% | |
| Other values (192) | 1762 | 40.6% |
| Value | Count | Frequency (%) | |
| 1 | 862 | 19.9% | |
| 2 | 454 | 10.5% | |
| 3 | 289 | 6.7% | |
| 4 | 206 | 4.8% | |
| 5 | 171 | 3.9% |
| Value | Count | Frequency (%) | |
| 323 | 1 | < 0.1% | |
| 307 | 1 | < 0.1% | |
| 296 | 1 | < 0.1% | |
| 291 | 1 | < 0.1% | |
| 289 | 1 | < 0.1% |
| Distinct count | 763 |
|---|---|
| Unique (%) | 17.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.9 KiB |
| 2019-08-12 | 147 |
|---|---|
| 2019-08-11 | 122 |
| 2019-08-13 | 109 |
| 2019-08-10 | 84 |
| 2019-08-08 | 77 |
| Other values (758) |
| Value | Count | Frequency (%) | |
| 2019-08-12 | 147 | 3.4% | |
| 2019-08-11 | 122 | 2.8% | |
| 2019-08-13 | 109 | 2.5% | |
| 2019-08-10 | 84 | 1.9% | |
| 2019-08-08 | 77 | 1.8% | |
| 2019-08-04 | 71 | 1.6% | |
| 2019-08-25 | 63 | 1.5% | |
| 2019-08-05 | 62 | 1.4% | |
| 2019-07-29 | 59 | 1.4% | |
| 2019-08-24 | 59 | 1.4% | |
| Other values (753) | 3482 | 80.3% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
reviews_per_month
Real number (ℝ≥0)
| Distinct count | 520 |
|---|---|
| Unique (%) | 12.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1483737024221454 |
|---|---|
| Minimum | 0.01 |
| Maximum | 13.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.07 |
| Q1 | 0.23 |
| median | 0.66 |
| Q3 | 1.55 |
| 95-th percentile | 3.963 |
| Maximum | 13 |
| Range | 12.99 |
| Interquartile range (IQR) | 1.32 |
Descriptive statistics
| Standard deviation | 1.335390003 |
|---|---|
| Coefficient of variation (CV) | 1.162853172 |
| Kurtosis | 7.313129462 |
| Mean | 1.148373702 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 2.212140844 |
| Sum | 4978.2 |
| Variance | 1.783266459 |
| Value | Count | Frequency (%) | |
| 1 | 162 | 3.7% | |
| 0.12 | 74 | 1.7% | |
| 0.08 | 68 | 1.6% | |
| 0.14 | 67 | 1.5% | |
| 0.16 | 62 | 1.4% | |
| 0.15 | 61 | 1.4% | |
| 0.1 | 59 | 1.4% | |
| 0.06 | 58 | 1.3% | |
| 0.07 | 55 | 1.3% | |
| 0.17 | 54 | 1.2% | |
| Other values (510) | 3615 | 83.4% |
| Value | Count | Frequency (%) | |
| 0.01 | 2 | < 0.1% | |
| 0.02 | 23 | 0.5% | |
| 0.03 | 45 | 1.0% | |
| 0.04 | 44 | 1.0% | |
| 0.05 | 43 | 1.0% |
| Value | Count | Frequency (%) | |
| 13 | 1 | < 0.1% | |
| 12.6 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 11.03 | 1 | < 0.1% | |
| 8.37 | 1 | < 0.1% |
calculated_host_listings_count
Real number (ℝ≥0)
| Distinct count | 55 |
|---|---|
| Unique (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.424452133794695 |
|---|---|
| Minimum | 1 |
| Maximum | 274 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 12 |
| Q3 | 45 |
| 95-th percentile | 203 |
| Maximum | 274 |
| Range | 273 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 61.09287529 |
|---|---|
| Coefficient of variation (CV) | 1.549618878 |
| Kurtosis | 4.451735451 |
| Mean | 39.42445213 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.184761119 |
| Sum | 170905 |
| Variance | 3732.339411 |
| Value | Count | Frequency (%) | |
| 1 | 626 | 14.4% | |
| 2 | 392 | 9.0% | |
| 3 | 211 | 4.9% | |
| 4 | 145 | 3.3% | |
| 8 | 142 | 3.3% | |
| 203 | 133 | 3.1% | |
| 5 | 129 | 3.0% | |
| 7 | 127 | 2.9% | |
| 6 | 126 | 2.9% | |
| 14 | 119 | 2.7% | |
| Other values (45) | 2185 | 50.4% |
| Value | Count | Frequency (%) | |
| 1 | 626 | 14.4% | |
| 2 | 392 | 9.0% | |
| 3 | 211 | 4.9% | |
| 4 | 145 | 3.3% | |
| 5 | 129 | 3.0% |
| Value | Count | Frequency (%) | |
| 274 | 106 | 2.4% | |
| 203 | 133 | 3.1% | |
| 157 | 49 | 1.1% | |
| 141 | 117 | 2.7% | |
| 114 | 71 | 1.6% |
availability_365
Real number (ℝ≥0)
| Distinct count | 355 |
|---|---|
| Unique (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 238.48558246828142 |
|---|---|
| Minimum | 1 |
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 33.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 118 |
| median | 292 |
| Q3 | 353 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 235 |
Descriptive statistics
| Standard deviation | 123.401892 |
|---|---|
| Coefficient of variation (CV) | 0.5174396321 |
| Kurtosis | -1.319390867 |
| Mean | 238.4855825 |
| Median Absolute Deviation (MAD) | 71 |
| Skewness | -0.5223164462 |
| Sum | 1033835 |
| Variance | 15228.02696 |
| Value | Count | Frequency (%) | |
| 365 | 336 | 7.8% | |
| 364 | 127 | 2.9% | |
| 362 | 89 | 2.1% | |
| 359 | 88 | 2.0% | |
| 358 | 84 | 1.9% | |
| 363 | 64 | 1.5% | |
| 356 | 64 | 1.5% | |
| 361 | 63 | 1.5% | |
| 360 | 62 | 1.4% | |
| 351 | 41 | 0.9% | |
| Other values (345) | 3317 | 76.5% |
| Value | Count | Frequency (%) | |
| 1 | 10 | 0.2% | |
| 2 | 15 | 0.3% | |
| 3 | 13 | 0.3% | |
| 4 | 7 | 0.2% | |
| 5 | 7 | 0.2% |
| Value | Count | Frequency (%) | |
| 365 | 336 | 7.8% | |
| 364 | 127 | 2.9% | |
| 363 | 64 | 1.5% | |
| 362 | 89 | 2.1% | |
| 361 | 63 | 1.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 49091 | COZICOMFORT LONG TERM STAY ROOM 2 | 266763 | Francesca | North Region | Woodlands | 1.44255 | 103.79580 | Private room | 83 | 180 | 1 | 2013-10-21 | 0.01 | 2 | 365 |
| 1 | 1 | 50646 | Pleasant Room along Bukit Timah | 227796 | Sujatha | Central Region | Bukit Timah | 1.33235 | 103.78521 | Private room | 81 | 90 | 18 | 2014-12-26 | 0.28 | 1 | 365 |
| 2 | 2 | 56334 | COZICOMFORT | 266763 | Francesca | North Region | Woodlands | 1.44246 | 103.79667 | Private room | 69 | 6 | 20 | 2015-10-01 | 0.20 | 2 | 365 |
| 3 | 3 | 71609 | Ensuite Room (Room 1 & 2) near EXPO | 367042 | Belinda | East Region | Tampines | 1.34541 | 103.95712 | Private room | 206 | 1 | 14 | 2019-08-11 | 0.15 | 9 | 353 |
| 4 | 4 | 71896 | B&B Room 1 near Airport & EXPO | 367042 | Belinda | East Region | Tampines | 1.34567 | 103.95963 | Private room | 94 | 1 | 22 | 2019-07-28 | 0.22 | 9 | 355 |
| 5 | 5 | 71903 | Room 2-near Airport & EXPO | 367042 | Belinda | East Region | Tampines | 1.34702 | 103.96103 | Private room | 104 | 1 | 39 | 2019-08-15 | 0.38 | 9 | 346 |
| 6 | 6 | 71907 | 3rd level Jumbo room 5 near EXPO | 367042 | Belinda | East Region | Tampines | 1.34348 | 103.96337 | Private room | 208 | 1 | 25 | 2019-07-25 | 0.25 | 9 | 172 |
| 7 | 7 | 241503 | Long stay at The Breezy East "Leopard" | 1017645 | Bianca | East Region | Bedok | 1.32304 | 103.91363 | Private room | 50 | 90 | 174 | 2019-05-31 | 1.88 | 4 | 59 |
| 8 | 8 | 241508 | Long stay at The Breezy East "Plumeria" | 1017645 | Bianca | East Region | Bedok | 1.32458 | 103.91163 | Private room | 54 | 90 | 198 | 2019-04-28 | 2.08 | 4 | 133 |
| 9 | 9 | 241510 | Long stay at The Breezy East "Red Palm" | 1017645 | Bianca | East Region | Bedok | 1.32461 | 103.91191 | Private room | 42 | 90 | 236 | 2019-07-31 | 2.53 | 4 | 147 |
Last rows
| df_index | id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4325 | 7697 | 37554967 | Single Bed in 6 Bed Female Dorm | 87731750 | Gap Year | Central Region | Kallang | 1.31605 | 103.85961 | Shared room | 25 | 1 | 1 | 2019-08-12 | 1.0 | 8 | 198 |
| 4326 | 7698 | 37576690 | Single Bed in 4 Bed Mixed Dorm | 87731750 | Gap Year | Central Region | Kallang | 1.31475 | 103.85874 | Shared room | 26 | 1 | 1 | 2019-08-10 | 1.0 | 8 | 189 |
| 4327 | 7699 | 37577304 | Private stylish room with ensuite bathroom | 8600508 | Helen | Central Region | Marine Parade | 1.30171 | 103.90113 | Private room | 169 | 2 | 1 | 2019-08-14 | 1.0 | 1 | 365 |
| 4328 | 7706 | 37587133 | FULLY FURNISHED 3 BEDROOM, CLEMENTI | 238891646 | Neha | West Region | Clementi | 1.31403 | 103.75964 | Entire home/apt | 344 | 3 | 1 | 2019-08-21 | 1.0 | 34 | 364 |
| 4329 | 7714 | 37619286 | Double Pod Capsule in Mixed Dorm | 87731750 | Gap Year | Central Region | Kallang | 1.31505 | 103.86022 | Shared room | 56 | 1 | 1 | 2019-08-13 | 1.0 | 8 | 121 |
| 4330 | 7715 | 37621650 | Comfortable and spacious four-bedroom family suite | 43591543 | Donald | Central Region | Geylang | 1.31410 | 103.90317 | Entire home/apt | 699 | 3 | 6 | 2019-08-23 | 6.0 | 15 | 189 |
| 4331 | 7728 | 37690516 | cozy Condominium in quite neighbourhoods | 165475492 | BOONChean | Central Region | Toa Payoh | 1.34063 | 103.88219 | Private room | 60 | 1 | 1 | 2019-08-12 | 1.0 | 1 | 1 |
| 4332 | 7752 | 37798739 | near Clementi MRT female only | 157856583 | Elyssa | West Region | Clementi | 1.30677 | 103.76224 | Private room | 56 | 1 | 1 | 2019-08-17 | 1.0 | 1 | 120 |
| 4333 | 7766 | 37841266 | Sunny Modern Condo in City Center walk to MRT | 39207304 | Sophie | Central Region | Rochor | 1.30074 | 103.84742 | Entire home/apt | 237 | 7 | 1 | 2019-08-25 | 1.0 | 12 | 159 |
| 4334 | 7767 | 37852422 | Private Room With King Size Bed Near Seng Kang MRT | 119880789 | Christine | North-East Region | Sengkang | 1.39324 | 103.89002 | Private room | 60 | 1 | 1 | 2019-08-25 | 1.0 | 1 | 298 |